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Abstract 

We prove that any graph G with n points has a distribution T over spanning trees such that for any edge (u, v) 
the expected stretch ETr^T[dT{u, v) /dciu, v)] is bounded by 0(log n). Our result is obtained via a new approach 
of building "highways" between portals and a new strong diameter probabilistic decomposition theorem. 

1 Introduction 

Let G = (y, E) be a finite graph. For any subgraph H = {V' , E') of G let dn be the induced shortest path metric 
with respect to H. In particular, for any edge (it, v) G E and any spanning tree T of G, driu, v) denotes the shortest 
path distance between u and v in T. 

Given a distribution T over spanning trees of G, let stretchrl^t, v) = Et^t and let stretch7-(G) = 

max(„ stretch7-(M, f )• Let stretch(n) = max(5_(y^^-)||y|^„ inf7-{stretch7-(G)}. 

Initial results were obtained by Alon, Karp, Peleg and West [2] showing that 
ri(logn) = strctch(n) = exp(0( Vlog n log log n)). The upper bound was significantly improved to 0( (log n)^ log log n) 
by Elkin, Emek, Spielman and Teng [10]'. For the class of Series-Parallel graphs Emek and Peleg [11] obtained a 
bound of 6(logn). The main result of this paper is a new upper bound on strctch(ri,) that is tight up to polylogarithmic 
factors-. 

Theorem 1. 

stretch(n) = O {\ogn ■ log log 7i • (log log log n)^) 
Remark 1. For ease of presentation we first show a slightly weaker bound of 

stretch(n) = O (logn • (log log n)^ • log log log n) , 
and prove the tighter bound in Appendix B 

Our result may be applied to improve the running time of the Spielman and Teng [16] solver for sparse symmetric 
diagonally dominant linear systems. 
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'in fact these result apply to a similar notion, avg — strctch(n) = maxg—jy g)| |y |_„ infj^l^gj v)eE dG(u'ij) which is equivalent 
up to a constant factor to stretch(n). 

-[9] announced strctch{n) = 0{(log n)^), but this claim was subsequently withdrawn by the authors 
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1.1 Techniques 



We extend the star-decomposition technique of Elkin et. aZ. [10]. A star-decomposition of a graph is a partition of the 
vertices into clusters that are connected into a star: a central cluster is connected to every other cluster by a single 
edge. As in [10] given a subgraph over a cluster X, the central cluster Xq is formed by cutting a ball with radius 
ro around a center xq and the remaining clusters Xi , X2 , ■ ■ ■ , which are called cones, are formed iteratively. Let 
Yj = X \ lJo<fe<j "'^fc- cone Xj is created by choosing an edge {yj,Xj) such that yj G XQ,Xj £ Yj-i and 
defining Xj as the cone with radius rj around Xj from the cluster Ij-i, as all the points whose distance to xq going 
through the edge [xj, yj) does not increase too much relatively to the shortest path distance, formally Xj = {.x S 
y,_i I dx{xo,yj) + dxiyj,Xj) + dYj_i{x.j,x)-dxixo,x) < r^}. Letradxa{X) = max^jgx d(a;o, 2^), then typically 
the radius of the central ball is chosen so that ro « ladxfX-^) / for ^ constant c. An important parameter of a star- 
decomposition is the radius of the cone. We say that the star-decomposition has parameter e if for any j > 1, the 
radius rj of the cone Xj is at most e ■ radx„ {X). 

Applying star-decompositions in a recursive manner induces a spanning tree T. For a point u denote by X'*' the 
cluster that contains u in the ith recursive invocation of the hierarchical star-decomposition algorithm. 

The 0(log^ n log log n) bound of [ 10] is obtained by choosing e sa 1/ log n and showing: 

1 . 0(1) radius stretch. For any cluster X induced by the recursive invocation of the hierarchical star-decomposition 
algorithm, and any z G X, dxixo, z) = 0(radj;„ (X)). 

2. 0((logn • loglogn)/e) decomposition stretch. For any edge (w, v), 

^jPr[(u,w) is separated when star-decomposing X*^*'] • diam(X'^*') = O(lognloglog72)/e. 

Combining these two properties yields their result, noticing that if the end points of an edge (m, v) fall into different 
clusters in the partitioning of X'^^^ then driu, v) can be bounded by driu, xq) + driv^ xa) = 0(diam(X(*))). 

Good radius stretch is obtained by observing that in each recursive application of the star partition the radius of a 
cluster is stretched by at most 1 -I- 1/logn, and since there are O(logri) scales the total radius stretch is a constant. 
Good decomposition stretch is obtained by using a version of the decomposition of [4, 8]. 

Better radius stretch. In our scheme we perform a star-decomposition with a parameter e « 1/ log log n, this 
significantly improves the decomposition stretch, by a factor of « log 71/ log log n. A naive attempt to bound the 
radius stretch, by 1 + 1/ log log n in each scale, will result in super logarithmic radius stretch over all scales. 

We introduce a new approach to bound the radius stretch. We arrange all the points of X in a queue Q = 
(zi, Z2, . . . , Zn), and bound the distance drixQ, Zi) as a function of i by building "highways" - low stretch paths. 
Roughly speaking, we obtain a bound of dT{xo,Zi) = O(loglogi) • radj;„(X). The core observation is that by 
choosing where to build the first cone and passing this information into the recursion, one can obtain a shortest path 
"highway" between xq and the first point zi, such that the distance between xq and zi in the tree will be exactly the 
original distance in the graph. The challenge is to use this observation to maintain "highways" - low stretch paths - 
between xq and all the points. Specifically, we obtain 

1. O(loglogn) radius stretch. For any cluster X, and any z G X, dxixo, z) — 0{\og\ogn)r&dxa{X). 

Better decomposition stretch. A relaxation of the spanning tree problem suggested by Bartal [3] is to consider a 
distribution of dominating tree metrics (in fact of ultrametrics) that do not necessarily span the graph. This relaxation 
has proven applicable for approximation algorithms, online problems and has contributed to recent solutions for the 
spanning tree problem {i.e. [10]). Initially O(log'^n) approximation was obtained in [3] based on the truncated 
exponential distribution approach of [14]. This bounded was subsequently improved to O(lognloglogn) in [4] and 
[8]. Finally an optimal 0(log n) approximation was obtained by [12] based on the cutting scheme of [7]. Subsequently 
an O(logn) bound was also obtained using a truncated exponential distribution approach [5, 1]. 

However, all previous schemes that obtained the optimal 0(log n) bound for the metric problem were insufficient 
for the spanning tree problem. Given a graph G ~ {X, E), a sequence xi,X2, . ■ . of cluster centers and a sequence 
ri,r2, ■ ■ ■ of radiuses we can define a weak diameter decomposition by defining Wi = Bx{xi, r.i) \ Uj<i ^i- We 
can define a strong diameter decomposition by defining Ci ~ Bx\\j . Cj{xi, fi). Observe that in a strong diameter 
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decomposition, for any nonempty cluster Ci, we have that Xi G Ci and Ci is a connected component of G, this 
may not be the case for weak diameter decompositions. Indeed the techniques of [12, 5, 1] provide a weak diameter 
decomposition. It was not clear how to extend these results to strong diameter decompositions that are necessary for 
star-decompositions. We show how to obtain a strong diameter hierarchical decomposition theorem that obtains an 
optimal bound in the following sense: 

2. 0(lognlog(l/e)/e) decomposition stretch. For any edge (u, w), 

Pr[(u, v) is separated when star-decomposing X^*)] • diam(X'^*'') = 0(log n log(l/e)/e). 

As in [5, 1], our decomposition is based on the truncated exponential distribution with a parameter depending on 
the local growth rate of the space. The main technical difficulty arises since the space changes after each cluster is 
cut (the metric is derived from a graph, and some nodes and edges are removed at every cut). The idea is to define 
the local growth rate with respect to the current metric, and to show two things: that the expected sum of all growth 
rates (which are random variables) over all the scales telescopes to n, and that the probability to be cut is appropriately 
bounded in each scale. Dealing with the randomly changing graph raises some additional subtleties in the proof. Our 
strong diameter hierarchical decomposition theorem may be of independent interest. 

1.2 Applications 

One of the main apphcations of low stretch spanning trees is solving sparse symmetric diagonally dominant linear 
systems of equations. This approach was suggested by Boman and Hendrickson [6] and later improved by Spielman 
and Teng [16]. Spielman and Teng showed an algorithm that for such an n-by-n matrix A with m non-zero entries and 
an n-dimensional vector 6, if e > is the precision of the solution then the algorithm finds x' such that 1 1 x — 1 1 a < e 

where Ax ~ b, and the running time is O (^m ^log'^^^-' m + log(l/e)^ + n ■ avg — strctch(ri) • log(l/e)^ . Improv- 
ing the bound requires improvement of the second element, and we improve it by roughly an additional 0(log log n) 
factor over [10]. Actually, if the running time of our construction is reduced, we can obtain an O(logn) improvement. 
For planar graphs we obtain 0{n ■ log^ n). See details in Corollary 6. 

The minimum communication cost spanning tree problem introduced in [13], in which one is given a weighted 
graph G = (V, E, w) and a matrix A = Uxy \ x,y G V, the objective is to find a spanning tree minimizing c{T) ~ 
J2x y<£V ' '^T^x, y). [15] showed an 0(2^^^°s"''°s log") approximation ratio based on [2], and [10] improved to 
0(log^ n ■ log log n). Our results can be used to obtain 0(log n ■ log log n(log log log nY) approximation ratio. 

See [10] for details about more applications. 

1.3 Structure of the Paper 

In Section 2 we describe a star-decomposition framework, that for any unweighted n point graph G induces a tree 
such that diam(T) < C>(diam(G') • log log n). In Section 3 we describe a distribution on star-partitions that fol- 
lows the framework of Section 2. We analyze the expected stretch of an edge and prove the bound of strctch(n) = 
0( (log n ■ (log log nf' ■ log log log ??) . In Appendix A we discuss briefly how to extend the result for weighted graphs. 
In Appendix B we show the tighter result stated in Theorem 1 . 

2 Highways 

Let G = (V, E) be a finite graph. For any X C V let dx '■ be the shortest path metric induced by the 

subgraph on X. Let diam(X) = maxj^^rgx{'^x(j/, z)}. For a; g X let rada;(X) = maxygx dx{x, y), we omit the 
subscript when clear from context (note that diam(X)/2 < rad{X) < diam(X)). For any x G X and r > let 
Bx.d{x, r) ~ {y E X \ dx{x, y) < r}. Let c = 2^^ be a constant. We use the uppercase letter Q to denote a queue, 
a sequence of points. Given a point x not in the queue we say that we enqueue x into Q meaning that we add x as the 
last element of the sequence and given a queue Q, the dequeue operation removes and returns the first element of the 
sequence. 
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Definition 1 (cone metric-'). Given a graph G ~ (V, E), subsets Y G X QV, points x & X\Y , y £Y define the 
cone-metric p = p{X, Y, x, y) : Y^ ^ R+ as p{u, v) = \{dx{x, u) — dyiy, u)) — {dx{x, v) — dyiy, v))\. 

Note that a ball BY,p{y, r) in the cone-metric p — p{X, Y, x, y) is the set of all points z e y such that dx{x, y) + 
dviy^z) - dx{x,z) < r. 

Hierarcliical-Star-Partition algoritlim. See Figure 1 for the algorithm. Given an unweighted graph G = {V, E), 
create a spanning tree T = (V, E') by choosing some xq G V, letting Q be an arbitrary ordering of V" \ {xo} and 
calling: hierarchical-star-partition(y, xq, Q). 

T = hierarchical-star-partition(X, Ko, Q): 

1. Ifrada;o(^) < 16c return BFS(X). 

2. (Xo, . . . ,Xm, (yiyXi),..., {ym,Xm), Qo, Qi, • • • , Qm) = star-partition(X, a;o, Q); 

3. For each i £ [0, . . . , m] : 

4. Ti = hierarchioal-star-partition(A^i, Qi); 

5. Let T be the tree formed by connecting To with Ti using edge {yi,Xi) for each i £ [1, . . . , m]; 

Figure 1: hierarchical-star-partition algorithm 



Star-Partition algorithm. See Figure 2 for our star-partition algorithm. We highlight the main differences 
of our algorithm from that of [10]. In addition to X, xq it receives as input an ordering of the points in X, implemented 
as a queue data structure and denoted by Q. In addition to returning a star decomposition Xq, Xi, . . . , Xm it returns 
for each < j < m an ordering of the points in Xj, implemented as a queue data structure and denoted by Qj. 

Since as noted above the trivial radius bound (loosing (1 + e) in every scale) does not work anymore we attempt 
to directly bound drixQ, z) for all z £ X. The arrangement of X \ {xq] in a queue Q = (zi, . . . , Zn-i) determines 
"how hard" we try to give a tight bound for the point Zi - roughly speaking the smaller value of i means the harder we 
try to give a better bound on dxixo, Zi). The star partition algorithm therefore changes to try hardest for the first point 
zi, and indeed by choosing the first portal edge xi) on a shortest path to zi and keeping zi, yi in the head of the 
recursive queues we obtain a "highway" from xq to zi, i.e. preserving the original distance. Surprisingly, this small 
change is enough to give a good bound on c?T(a;o, z^) for alH > 1, and we obtain dT(a;o, -^i) = 0{\og\ogi)Ya,dxo{X). 
The intuition is that since every cluster contains less points, Zi advances in the recursive queues, and when it becomes 
the first we get a "highway" to it. For this intuition to work one must delicately define the ordering of the queues 
Qoj ■ ■ • , Qm for the clusters Xq, . . . , Xm created by the star partition algorithm. The main difficulty is defining Qq, 
as the portals yj play a dual part - we need to maintain their original position in Q and also make sure that the tree 
distance to them is small enough: as it determines the distance from to all the points in Xj . 

Suppose Zi G Yj for some i> 1. By Claim 2 there is an inherent loss of a 1+e factor due to star-partition algorithm. 
Hence it is not sufficient for the inductive argument to simply obtain a bound of drixo, yj) = 0(log logi)rad2;o (-^o) 
in the ball Xq and dT{xj,Zi) = 0{loglogi)Tadx {Xj) in the cone Xj. We must "gain" inductively either in 
dxixo, yj) (the ball part of the path) or in dxixjjZi) (the cone part of the path). This is done by choosing the queues 
in the following manner: Given a star decomposition Xq, Xi, . . . , X,n we create the queue Qj for j > simply as the 
restriction of Q on Xj \ {xj}. The queue Qo is the created by first adding either zi or the portal yi which is chosen 
on a shortest path to zi, thus making sure the distance from xq to zi is preserved in the recursion. Then interleaving 
three different queues Qq*"""^ , Qq"'^^ , Qq '^^^ . 

• Qo'^"^ is the restriction of Q on Xq. This queue provides the required bound on dxixo, Zi) when Zi G Xq. 

• Qq is a queue of portals yj ordered by the minimal point of Q that their cones Xj contains. When a cone 
contains relatively few point we "gain" in the cone part of the path to z^. This queue guarantees that for any 
Zi G Xj the "central ball" part of the path to Zi is not stretched too much. 

^In fact, the cone-metric is a pseudo-metric. 
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• Qg is a queue of portals yj that lead to cones that contain "many" points relative to the ordering Q of the 
points in Xj. When a cone is "fat" we cannot gain in the cone part, this queue guarantees that we gain in the 
ball part. 



The exact way these three queues are created is detailed in Line 5 of Figure 2. 

{Xo, . . . , Xm, (yiyXi),.. . , {ym,Xm),Qo,Qi, ■ ■ ■ ■ Qm) = s t a r-p a r t i t i On (X, , Q): 

1. Let J = 2; Denote the (ordered) elements of Q by Q = {zi, Z2, . ■ ■ , Zk); Let e = e{X) £ (0, j^]; 

2. Creating the ball Xq: 

(a) Choose ro uniformly at random from the interval [1 / (16c) , 1 / (8c)] ; 

(b) LetXo = B{xo,ro ■ rad:,o(X)); Let Yo ^ X \ Xq; 

3. Creating the first cone Xi! 

(a) If zi G Yo let z — zi otherwise let z £ Yohe an arbitrary point. Let {yi,xi) be an edge such that 
yi G Xq, xi G Yo and dx{xo, z) = dx{xo, yi) + dx{yi,xi) + dY„{xi,z) (i.e. an edge on a 
shortest path from xq to z); 

(b) Let p = p{X, Yo,xo, xi) be the cone-metric; 

(c) Choose ri uniformly at random from the interval [e/4, e/2]; 

(d) Let Xi = B^Y„,p){xi,ri ■ rad,„ (X)); Let Yi = Yo \ Xi; 

4. Creating the remaining cones X2,. . . ,Xm- 

(a) While y,_i / : 

i. Let (xj, yj, rj) = cone-cut(X, xq, Xq, Yj-i,e); (has the property that rj < e/2) 

ii. Let p = p(Yj-.iU Xo,Yj-i,xo,Xj)- 

iii. LetX^ = B^Yj_i,p)ixj,rj ■ iad^„{X));Yj =Yj.i\Xj; 

iv. Let j = j + 1; 

5. Creating the queues Q^,'"'"' , qJ,'"\ Qi, . . . , Q™: 

(a) Fori = 1, . . . , ]X| - 1 : 

i. If Zi G Xq then enqueue Zi into Qq''''"' ; 

ii. Otherwise let ^ > 1 be such that Zi £ Xf. 

• If Zi ^ xe then enqueue Zi into Qi. 

• Ifye ^ Qo"^*' then enqueue ye into Q^'^K 

• If \Xi n {zi, . . . , Zi}\ > \/i and yi ^ Qg'"'' then enqueue ye into Qg'"'^. 

6. Creating the queue Qo: 

(a) Denote Qq ' = 2:1 , . . . , 2„ ^ , Qq ' = Zi , . . . , z^^ , Qq = , . . . , z^^ . 

(b) Create Qo by interleaving the three queues Qq''''"'' , Qg''"' , Qq"^^'' such that: 

• If 2i G Xq then Zi is the first element of Qq. Otherwise j/i is the first element of Qq. 

• For any a:: G X, ^ G {1, 2, 3}, I < i < nif x = zf then x is in the first 3i elements of Qo- 

Figure 2: star-partition algorithm 



2.1 Bounding the radius stretch 

In this part we show that the radius stretch induced by the hierarchical-star-partition algorithm is at 
most 0(log logn). 
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The following two claims imply that the star-partition algorithm on a cluster X induces a partition on X 
and that radial distances are stretched by a most 1 + e. These claims are essentially proven in [10] we provide a proof 
for completeness. 

Claim 1. For any graph X, a:o G X, j > let Yj^i Q X be the unassigned points of X after creating j clusters 
Xq, . . . , Xj^i using the star-partition algorithm, then for any z G i^-i all the shortest paths from z to .tq are 
fully contained in Yj-i U Xq, in particular 

dY,-iuXo{xo,z) = dx{xo,z). 

Proof. Let A = rada;^ (X). Let Pz,xo be a shortest path and assume by contradiction that Pz,xo ^ ^-i U Xq, so let 
1 ^ * < J — 1 be the minimal i such that there exists u G Pz,xo ^nd u £ Xi. Let Xi be the portal to the cone Xi. By 
Definition 1 since u G it must be that in the metric d' = dxoUYi^i 

d'{u, Xq) + r.i ■ A> d'{u, Xj) + d'{xi,xo). 

Since u lies on a shortest path from z to xq, the minimality of i suggests that this shortest path is fully contained in 
Yi^i U Xq thus d'{z, xq) — d'{z, u) + d'{u, xq), and conclude that 

d'{z, Xq) + Ti ■ a = d\z, u) + d'{u, Xq) + ?'i • a > d'{z, u) + d'{u, Xi) + d'{xi,xo) > d\z, Xi) + fi'(x'i, xq), 

hence z should be in Xi, contradiction. □ 

Claim 2. Lef (Xq, . . . , (yi, xi), . . . , (y™, x^), Qo, Qi, ■ • ■ , Qm) = star-partition{X,xo,Q) then for 
any I < j < ni 

rad:ro(-'^o) +d{yj,Xj) +ra.dx^{Xj) < (1 + ejrad^i.^ (X), 

Proof. Let A = rad^o {X). Let /3 be such that radj;^ (Xq) = /S ■ A, let d' = dx„uY-^i, let xj be the portal of Xj and 
p = p{Xo U Yj-i, Yj-i, Xq, Xj) be the cone-metric. Take z G Xj as the farthest point from Xj (with respect to d'), 
take any shortest path Pa;^ 2 from Xj to 2 and separate it into consecutive segments Xj = uo,vo,ui,vi, . . . ,Uk,Vk = z 
such that for any < i < fc, p{ui, Vi) — 0, i.e. 

d'(xo, Ui) - d'{xj,Ui) = d'{xo, Vi) - d'{xj,Vi) 

and {vi, Ui+i) G E (note that it could be that Ui = Vi). The definition of cone-metric suggests that A: < • A, as 
otherwise z ^ BY._^^p{xj, rj - A) ~ Xj. 

Since Pxj.z is a shortest path we have for all < i < A: that d'{xj,Ui) + d'{ui,Vi) — d'{xj, Vi), therefore 

k k 

^(i'(xo,Vi) = ^((i'(xo,Wj) + d'{ui,Vi)). (1) 

i=0 1=0 

Claim 1 suggests that dx{xQ, z) ~ d'(xQ, z), hence 

A > dxixQ,z) = d'{xo,z) = d'{xo,Vk) 

k-l 

= X] {d'ixo, Ui) + d'{ui,Vi) - d{xo,Vi)) + d'{xo, Uk) + d'{uk, Vk) 

i=0 
k-l 

- X! (^'(^o,"i) + d'{ui,Vi) - (d'{xo,Ui+i) + d'{vi,Ui+i))) + d'{xo,Uk) + d'{uk,Vk) 

1=0 

k-l 

= d'{xo,uo) - d'{xo,Uk) + ^{d'{u^,v^) - 1) + d'{xo,Uk) + d'{uk,Vk) 

4=0 

k 

= {f3A + l)-k + Y,d'{u„v^) 

1=0 
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The second line follows from (1), the third from the fact that d'{xo, Vi) < d'{xo,Ui+i) + d'{ui+i, Vi), the fourth since 
the sum telescopes and d' {vi,Ui+i) — 1, and the fifth since d'{xo,uo) = d'{xo,Xj) = d'{xQ,yj) + d'{yj,Xj) = 
rad,„(Xo) + l=/3A + l. 
Therefore 

k fc-1 

ladx^iXj) =d'{xj,z) = Y,d'{ui,v^) + ^d\vi,u^+i) < (A - /3A + fc - 1) + < (1 - /3)A + 2r^A - 1, 

(recall that k < rjA). And now since rj < e/2, 

rad^o(Xo) + d{yj,Xj) + rad^,^. (X^) < /3A + 1 + (1 - /3)A + eA - 1 = (1 + e)A. 

□ 

Corollary 3. For any < j < m, rada;^. (Xj) < (1 — ■^)TSidxo{X) 

Proof. The corollary is immediate for Xq by the construction, for j > 0: as Ta.dxo{Xa) > Ta.dx„{X) / {16c) and 
e < l/(170c) using Claim 2 

rad,^ (X,) < (1 + e)rad,„(X) - rad,„(Xo) < (1 - l/(20c))rad,, (X). 

□ 

Lemma 4. Let X QV be a connected component of GiV^ E). Let Xq Cz X and Q = {zi, . . . , z\x\-i) be any ordering 
of X\{xq}. Let T be any spanning tree ofG returned by the algorithm hierarchi cal- star-part it ion{X, xq, Q) 
with parameter e ^ e{X) = i70ciog^iog(|X|) ' 

{dx{xo,Zi) 1 = 1 

i ■ ladxo i^) I < i < c 

c ■ log log i ■ radaij, {X) otherwise 

(where c^2^^} 

Proof. The proof is by induction on the radius of X. In the base case when radaj^ [X] < 16c create a breadth first tree 
centered in xq, and since in such a tree for every z G X, dx{xo, z) = dxixo, z) the claim holds. Now we turn to the 
inductive step. Note that Corollary 3 guarantees that for all j = 0, . . . , m we have < rad^^ i^j) < "^^'^x,, {^)- 

The main idea of the proof is to consider a single application of the star-partition algorithm, partitioning X into 
Xo, Xi, . . . , Xm. Assuming that z, e Xj the path between to Zi will be the path going through the edge [yj, Xj). 
Then use the induction hypothesis on the sub-path xo,yj in Xo and the sub path Xj, Zi in Xj. Since by Claim 2 the 
radius may increase by a factor of at most 1 + e, we need to "gain" in one of the two sub paths. This "gain" will occur 
since our construction guarantees that either the position of Zi in the queue of Xj will improve or the position of yj in 
Xq will improve, thus the induction hypothesis will give the required bounds. 

There are three main cases to consider, when i = 1, i < c and i > c. The case i = 1 is simple. The case 1 < i < c 
subdivides into three more cases: 

1. The first case is Zi e Xq. This case is relatively straightforward. 

2. The second case is that the first i points of the queue are all in Xi. Here we gain in the central ball because the 
portal yi leading to Xi wiU be the first element in Qo- 

3. The remaining case is that not all of the first i points are in Xi, then there are at most i — 1 points in the cone Xj 
among zi, . . . , Zi, so by the construction of Qj, we gain just enough in the cone (because the bound that needs 
to be shown is weak - linear in i) and Qq"^*' guarantees that we do not lose too much in the central ball. 

The interesting case is when i > c, this last case also subdivides into three more cases: 
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1. One first is that Zi G Xq. Again, this case is relatively straightforward and uses the construction of Qq''^"''. 

2. The second case is that z,; e Xj and Xj is a "thin" cone - contains less than VI of the first i points. Here we 

gain in the cone because the position of Zi in Qj is at most \/i and Qo"^*"* guarantees that we do not lose too 
much in the central ball. 

3. The third case is that Zi G Xj and Xj is a "fat" cone - contains more than ^/i of the first i points. Here we gain 

in the central ball, using the construction of Qo*^^'' and Claim 5 to show that the portal yj leading to the cone is 
in position < i^/^o in Qq. 

We continue with the formal proof of the lemma, according to the three main cases. 

Case 1: In this case i = 1. Note that zi £ Xq U Xi. If zi G Xq then by the construction zi is going to be the 
first in Qo therefore by the induction hypothesis on Xq it follows that dxixo, zi) < dx{xo, zi). If on the other hand 
zi G Xi, then again from the construction the point yi, which was chosen such that yi, xi are on a shortest path from 
Xq to zi, will be the first in Qo, and zi will be the first in Xi, so by induction dxixo, zi) = (ccg, + dxiyi, Xi) + 
dT{xi,Zi) < dx{xo,yi) +dx{yi,xi) +dx{xi,zi) = dx{xo,zi). 

Case 2: The second case to consider is when 1 < i < c. 

1. First assume that Zi G Xq. Then Zi will be at most i in the ordering of Qq''^"^ and hence at most 3i in the 
ordering of Qo- By the induction hypothesis on Xq '■ drixo, Zi) < cloglog(3i) • ladxaiXo) < i ■ T:a,dxg{X), 
using that radxo i-^o) < rada;^ (X) / (8c), and that log log(3i) < 2i. 

2. Now assume that {zi, . . . , Zi] C Xi. As yi is the first in Qo, by the induction hypothesis on Xq and Xi we 
have that dT{xo,yi) < dx{xo,yi) < rad:ro(^o) and drixi, Zi) < i ■ radj;j(Xi), so 

drixo, Zi) < drixo^yi) + driyi^xi) + drixi, Zi) 

< rad:ro(-'^o) + i ■ Ta.dx^{Xi) + dx{yi,xi) 

< i(rad^o(Xo) +dx{yi,xi) +Ya,dx,{Xi)) - (i - l)rad^(,(Xo) 

< i(l + e)rad,„(X)- (i- l)rad,„(X)/(16c) 

< i ■ rad,„(X) + I ■ rad,„(X)/(170c) - i ■ rad,„(X)/(32c) 

< i-rad^o(X). 

In the fourth inequality using Claim 2 and that rad^o (-'^o) > rad^ro (X) / (16c) (note that by the stop condition 
of hierarchical-star-partition Ta,dxg{X) > 16c, so ladxaiXo) > 1) and in the fifth that i — 1 > i/2. 

3. Now assume that Zi G Xj where not all of zi, . . . , Zi are in Xj (note that zi G Xq U Xi, therefore there is no 
case for {zi, . . . , Zi} C Xj where j > 1). First note that Zi must be at most the i — 1 element in Qj. By the 
insert sequence to Qq"^*^""* we have that yj is at most the 3i element in Qo- Using the induction hypothesis on Xq 
and Xj we get that 

dT{xo,Zi) < dT{xo,yj) + dT{yj,Xj)+dT{xj,Zi) 

< cloglog(3i) - TadxoiXo) + (i - 1) - Ta.dx^{Xj) + dx{yj,Xj) 

< (i - l)(radj;o(Xo) + dx{yj,Xj) + Ya.dx.{Xj)) + 5c - ladxaiXo) 

< (i - 1)(1 + e)rad,„(X) + 5c - rad,„ (X)/(8c) 

< i ■ mdx,{X) - mdxjX) + (i - 1) • rad,„(X)/(170c) + 5rad,„(X)/8 

< i-rad:ro(^)- 

Thethirdinequality follows since log log(3i) < loglog(3c) < 5. The fourth using Claim 2 and that rad^ro (Xo) — 
rad,„(X)/(8c). 
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Case 3: In the third case i> c. 

1. First assume that Zi e Xq. Then Zi will be at most i in the ordering of Qq'"'"^ hence at most ii in the ordering 
of Qo- By the induction hypothesis on Xo we get that 

dT{xo,Zi) < cloglog(3«) • rada;o(^o) < 2c log log i • rad^o(Xo) < clog log i • rad^rol-'^) • 

using that for i > c, 3i < i^. 

2. Next assume that Zi e Xj such that \Xj n {zi, . . . ,Zi}\ < then Zi will be at most the \/I in Qj, and yj will 
be at most the i-th in Qg"^^' and hence at most 3i in the ordering of Qq. By the induction hypothesis on Xq and 
Xf 

drixo, Zi 



< 




+ dT{yj,Xj) + dT{xj,Zi) 




< 


clog log (3 


.) • rad:ro(^o) + cloglog(%/i) • rad^j;^ {Xj) + dx 


{yj^Xj) 


< 


c(log logi 


+ 1) • rada;o(^o) + c(loglogz - 1) • ra.dx^{Xj) 


+ dx{yj,Xj) 


< 


c(log log i 


~ 1) (radj;o(Xo) + dx{yj,Xj) + rada.^(Xj)) + 


2c-rad,„(Xo) 


< 


c(log log i 


-!)(! + e)rad,„ {X) + rad,„ (X)/4 




< 


c log log i ■ 


rad^„(X) + clog log i • rad2;o(^)/(170cloglog 


-i) - c • rad:ro(^) + rad^o(^)/4 


< 


c log log i ■ 


rad:co(^), 





the fifth inequality using Claim 2 and that rada;^ (Xq) < Ta.dxo{X)/{8c), the sixth thate < l/(170cloglogi). 

3. The last subcase is where Zi e Xj such that \Xj n {zi, . . . , Zi}\ > Vi, then Zi will be at most the i in Qj and 
by Claim 5 yj will be at most the i^/^" in Qo. Now by the induction hypothesis, for t > 2 



dT{xo,Zi) < dT{xo,yj)+dx{yj,Xj)+dTixj,Zi) 

< clog log • rad^o(Xo) + clog log i • ra.dx.{Xj) + dx{yj,Xj) 

< clog log i(radj.„(Xo) + dx{yj,xj) + i-a.d.^^{Xj)) + clog(9/10) • rad^o(Xo) 

< cloglogi • rad^o(X) + e • cloglogi • rad^jjX) - c • rada;„ (Xo)/10 

< c log log i ■ rad,„ (X) + rad,„ {X)/170 - rad,„ {X)/160 

< cloglogi • radj:o(X), 

the fourth inequality using Claim 2 and the fifth that radj;„ (Xq) > rada:^ {X) / (16c) and e < 1/ (170c log log i) 



□ 



The following claim shows that a portal yj leading to a point Zi that belongs to a "fat" cone will be located in an 
improved position in the queue of the central ball Qo- 

Claim 5. For any i > 2^^, if Zi G Xj such that \Xj n {zi, . . . , Zi}\ > \pL then yj will be at position at most v'l^^ in 

Qo- 

Proof. We will show that yj will be in the first (3/2)z^/'^ + 1 elements of Qq'^^'^ Since i > 2^^ it follows that yj will 
be in the first 3 • ((3/2)^2/3 + 1) < i^/io elements of Qo- 

Let j/ij , . . . , yi_^ with ii < i2 < ■ ■ ■ < be a set of s points that were inserted into Q[f'"'' before considering the 
point Zi, we need to show that s < (3/2)i^/^. Let z^'^ , . . . , Zi'^ be the set of points in Q such that yi^ was inserted 
because z^/ G X,^. and Xi^ was a "fat" cone, i.e. \Xi^ n {zi, . . . , z^/ }| > -v/I^- Let Ai^ — Xi^ n {zi, . . . , z,/ } 



i'^^. Let A,, 

denote the set that caused yi^ to enter Qo '"\ and note that 1^^,^ | > > \/k. For any 1 < fc < £ < s we have that 
Ai^, n Ai^ = 0, since we do not insert a point y^^ that already appear in Qq*^^'', which implies X,,, n Xi^ = 0. Note that 
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all the sets Ai^ contain points from zi, . . . , Zi, so we have that J2k=i \^ik I ^ Hence J2k=i — J2k=i \^ik I — 
We also bound the sum from below 



k=i 



therefore i > {2/3)s^/^ or s < (3/2)^2/3. □ 

Corollary 6. For any weighted graph G = {V, E) denote by \ V\ = nand\E\ = m, invoking hierarchical-star-partition 
algorithm on G where in star partition algorithm we use the 

ImpConeDecompose{G, BS{xo,rQ ■ rad(X)), rad(X)/ loglogn, log log n, m) of [10], then we get a single span- 
ning tree T such that 

- E j4^<0(logn.(Ioglogn)3). 

The running time is 0{m log n) if G is unweighted and 0{m log n + n log^ n) ifG is weighted. 

Proof. Since our algorithm works in a similar manner to the [10] algorithm, we can use their partitioning method 
ImpConeDe compose, which has a a running time of 0{m) if G is unweighted and 0(m + n logn) if G is weighted. 
The only difference is that in the first iteration (j ~ 1), instead of picking an arbitrary portal xi we pick the node xi 
that is first on a shortest path from xq to the first in the queue Q. The average stretch of their cone cutting method is 
roughly 0(log n ■ log log n ■ 1 /e) (recall that e = 1/ log log n), and since the radius of our spanning tree increases by 
0(log log n), the corollary follows. It remains to see that our running time is no worse than [10], and indeed it is easy 
to see that adding the queues increase the run time only by a constant factor. □ 



3 Strong Diameter Probabilistic Partitions 



{x,y,r) — cone cut{X,xo,Xo,Y,e): 

• Let p £ Y be the point minimizing t-^ -, ^-^-^ — / v\ /ir-\\ over all 2 G Y; Let y denote that 

minimum; 

• Let {y, x) be an edge such that x £ Y , y a Xq and dx{xo,y) + dx{y, x) + dY{x,p) — dx{xo,p) (i.e. 
y and x lie on some shortest path between xo and p)\ 

• Choose r G [e/4, e/2] according to the following random process: 

- Divide the interval [e/4, e/2] into A'^ = [2 log x \ equal length intervals 5*1, ... , ^jv; Let h = 1; 

- LOOP: Toss a fair coin; If it turns out head and h < N then \eth = h + l and goto LOOP; 

- Choose r uniformly at random from the interval Sh. 

• Return (a;, y, r). 

Figure 3: cone-cut algorithm 

Consider a graph G = (V,£^), a connected cluster X C_ V , xq £ X and let A = ladxaiX). Fix some edge 
{u, v) € E. Let X'*) = X(')(it) be a random variable that indicates which cluster contains u in the i-th step of the 
hierarchical application of the star-partition algorithm"*. In a similar manner let Xq be the random variable indicating 

the center of the cluster X''', and when X*^*' is partitioned denote the central ball as Xq*' and cones as xf' , . . . Xm 
where m is a random variable depending on X*^'^. Let £j{X^''\ u, v) be the event that u,v <E X'*) and in the star- 
partition of the cluster X'*) with center x^'^ into X^'' , . . . , , u £ xf \ v ^ xf \ Let , u, v) be the event 
that 3 < j < TO such that £j (X'*' , u, w). Some notation: 

'*We abuse notation and think of X'*' as a function to subsets of X (instead of R). We also refer to X^'^ as an event. 
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Ea-(0 [f{X'^'^)] will stand for J2x' P^X'^^^ = X']f{X' 



Let T be the support of the distribution over spanning trees induced by the hierarchical star partition algorithm. 
Let T*^*' C T be the set of spanning trees for which event , u, v) occurs. 



E[dT{u,v)] < J2 Pr[T]-dT{u,v) 

i>l TeT(i) 



< y^E^w Pr\£ (X^'\u,v)] max {dT(u,v)} 



i>l 



The last inequality holds since for any < 2rad (i) (T) and using 

Lemma 4 we get that rad (o (T) < 0(loglogn • rad 

In what follows we bound E^w Pr[£: , w, u)] • rad^(,)(XW) . Lete = lyoe.iogiog | x| andfc = 20c(ln(l/e)+ 
5). The main lemma to prove is the following 

Lemma 7. For any graph G = {V, E), any edge {u, v) <E E, any connected cluster X*-*) C y we have that 

Wv[£{X^^ , u, v)] ■ rad^,., {X^^)] < C ■ d{u, v)/e ■ (e^,,., [log \X^^ |] - E;,(.+., [log | 
L -^0 J V 

where C is a universal constant. 

Once this lemma is proved, a telescopic sum argument yields that 

ndT(u,v)] < 0(loglogn)^E;,w [Pr[£(X«,u,z;)] •rad,„(X«)" 

i>i 



< 



0(log log n) ■ d{u, v)le ^ E^^ <.) [log \X^] 



< 0(log n ■ log log n) ■ d{u, v) ■ log(l/e) /e 

= 0(log n ■ (log log 7i)^ • log log log n) ■ d{u, v) . 

As we stated in the introduction, the algorithm of Figure 3 and proof of Lemma 7 are based on the truncated 
exponential distribution approach of [5, 1]. The main technical difficulty arises since the space changes after each 
cluster is cut. Dealing with the randomly changing graph raises some additional subtleties in the proof. 

We begin with some definitions and an informal description of the algorithm and the proof idea. Fix the edge 
(u, v) € E, Si scale i and X = X'-^\ Let F C X be a random variable indicating that there exists < j < m such 
that Y = Yj^i in the star partition of X. Define the local growth rate around x E Y with respect to Y as 



xiX,Y,x) 



\X\ 



\Bya^ ix,e A/ 16)\ 

The algorithm for the partition is as follows: Choose a radius for the central ball around xq from a uniform 
distribution in a range of size w A/c. The center xi is chosen on a shortest path to zi, the first point in the queue, and 
then the radius for the cone is again sampled from a uniform distribution in a range of size « eA. For j > 1 the jth 
center Xj is chosen on a shortest path to the point pj G ^-i minimizing Xj = x{X, and then the radius of 

the cone is chosen from a truncated exponential distribution, with parameter Xj- 

Denote the event that Y = Yj-i and u e Xj as Zj{X, Y, u), and let Z{X, Y, u) be the event that 3 < j < m 
such that Zj{X, Y, u). Note that fixing Yj_i determines deterministically pj and therefore also Xj and Xj- Similarly 
let Zj{X, Y) be the event that Y = Yj_i and Z{X, Y) the event that 3 < j < m such that Zj{X, Y). Let N{j) 
be the random variable that is the number of partitions Si, ... , S'7v(j) of the interval [e/4, e/2] for the jth cone. Let 
< h(j) < N{j) be the random variable that is the index of the interval S'/j(j) from which the radius rj is uniformly 
chosen for X j. Some more notation: 
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Eycx [/(>")] will stand for Y.ycx P42:{X, Y)] ■ f{Y) (we write Ey when X is implicit). 
Eycxj [f{Y)] will stand for Eycx P^i^j ^)] ' /(^) (we write Ey.j when X is impHcit). 
Eycx,u [/(>")] will stand for J^ycx ^^i^i^' ^> ")] ' /(^) (we write Ey,„ when X is impHcit). 

We divide the event £(X, u, v) into three cases (by symmetry we can define all these events with respect to u). 

• The first is the event that u falls into one of the first two clusters (the central ball Xq or the first cone Xi). This 
event is denoted by G{X, u). 

• The second is the event that u is contained in cluster Xj for some j > 1, such that the cone distance between 
u and the center Xj is in the last interval i.e. that p{xj,u)/A £ S^ij)- This event is denoted by !F{X,u). 
We partition the event J-{X, u) using the different values of j: For any j > 1 let J-j{X, u) be the event that 
p{xj,u)/A G 'S'jv(j); and note that !F{X,u) is simply that there exists j > 1 such that Tj{X,u) and also 

u e Xj. 

• The third is the completion of the first two events, that the cluster Xj containing u has j > 1 and p{xj , u) /A ^ 

The probability of the first event can be bounded simply by the inverse of the range from which the radius is drawn, 
so we obtain probability at most « i^^i 

For the second event we note that reaching the tail of the exponential distribution requires that — 1 fair coin 
tosses turned out head, which is bounded by « ^ « -i^, then since we choose uniformly from the last interval, 

Xj 

the probability that we separate m, u is « cAx^^"'"'* — t Ax""* ' ^^^^^ parameter Xj is a random variable which 
depends on the previous cone cuts, the proof becomes a bit more involved as we need to give a different bound for 
every possible Y = Yj-i- We show that for every star-partition X]j>i Xj^ — ^' hence this also holds in expectation 
and the second event probability is bounded by k, ffeii. This is shown in Claim 8 

Bounding the third event relies on the memoryless property of the exponential distribution. The major technical 
difficulty is that the bound we show depends on the parameter x- Hence we can only show the bound given some 
subspace Y from which we cut the next cone. The bound on the probability obtained here is sa ^°&x-d(u.v) ^ rpj^^^ 
shown in Claim 9. 

The last step is to sum over all scales i, and use a telescopic sum argument on the expectation of the values of the 
log X showing that they sum to 0(log(l/e) • log n). This is shown in the proof of Lemma 7. 

Claim 8. For any cluster X (-V, edge u,v € X, (u, v) G E, we have 

Pr[T{X,u) A£{X,u,v)] < 48d{u,v)/{eA) . 

Proof. Note that we can only bound the probability of event such as £j{X, u, v) given that some Y = l^-i is fixed 
i.e. that event Zj{X, Y) occurred (because the parameters Xj and Xj that govern the next cone creation are random 
variables depending on Y. So fix some Y = Yj^i and note that indeed pj, Xj and Xj = xi^: ^^Pj) are determined 
deterministically. 



Pr[T{X,u) /\8{X,u,v)] 

= Pr[3j >\,Tj, {X, u) A £j, (X, u, v)\ 
< J2Pr[£,iX,u,v)\T,{X,u)] 

j>2 YCX 
J>2 



12 



The first equation holds since the probability to be cut by a cluster whose radius is "large" is the probability that some 
cluster Xj with large radius separates u, v. The first inequality holds by the union bound and the second equation since 
for every event A and pairwise disjoint events Bi, . . . ,Bi with Pi'l-^i] = 1 it holds that Pr[yl] — X]i=i Pi'i-Si] • 
Pr[A I Bi\. Here the events B are Zj{X, Y) which are disjoint for different subgraphs Y . Note that events Tj{X, u) 
and Zj{X, Y) tell us nothing of the radius of the next cone Xj, therefore the probability of £j{X, u, v) given the 
subspaceYj_i and that /9(a;j , u)/ A e >S'jv(j) (where p = FUXq, d', Xq, Xj) is the cone metric), is the probability 
that h{j) — N{j) (recall that the random variable h{j) is the index of the interval Sh{j) from which the radius is 
uniformly chosen for Xj) and that the uniform choice in the interval S^ij) hits the place that separates u, v. To bound 
the first one 

Pr[/i(j) = N{j)] 2-(^(^'-i) < 2-2i°gX:-+2 = 4/x^ 



d{u,v) 
A|SiV0)l 



and the probability of the second event is 
These two events are independent, hence 

Pt[T{X,u) A£{X,u,v)] < 

< 



Note that IS* 



NU) I 



4[21ogXjl 



> 



Slog Xi+4 



> minjl, T—^ — I • rpr 

— L J log Xj J 12 



48d(u, v) 
e- A 

48d(u, v) 
£• A 



A7 At 



For any Y = (Fi,y2, ■ ■ ■ ,Yn) C X'^ let be the event Ai<j<„ 2:{X,Yj,j) (where is the jth component 
of y). Observe that for any j and F C X we have Pt[Z{X,YJ)] = EycX" y,=y Pr[^(^)]- Therefore 



J">1 FCX 

J2 Pr[Z{Y)]-Xj 



E 
E 

YCX 



Pr[ZiY)]J2-^ 



Now it is enough to show that for any Xo,Xi, . . . ,X„i that may occur in the start-partition algorithm (i.e. 
Pi[Z{Y)] > 0, given that Yj ~ X\[ji^j Xi) we have X^Jli — 1- This holds because for any 2 < £ < j < mwe 
have that B^^.d^, ipi, eA/16) C Xe, and Y^ 0X^ = 9, i.e. BY,,d^^ {p^, eA/16) n By^^d,-. {Pj,eA/16) ^ 0. Therefore 



Ex7' < \X\~'Y.BY„d,iP„eA/W) < 1. 



i=i 



□ 



Claim 9. For any cluster X C_V, edge u,v £ X, (w, u) e _E, subgraph Y d X we have 

Pt[£{X,u,v) a I A Z{X,Y,u)] < 12d(u, w) max{l, logx(X, w)}/(e • A) 

Proof. If w) > e • A/12 the the claim is trivial, so assume it is smaller Let j > 1 be such that the next cone to 
be cut is Xj (the value of j is not relevant, we fix it in order to simplify the notation), and recall that fixing Y = Yj-i 
determines deterministically pj, Xj and Xj- Let p = p{Xo U Y, Y, xq, Xj) be the appropriate cone metric on Y by 
which the next cone is cut. 



Pr[£'(X,u,t;) A | Z{X,Y,u)] < Pr[£j{X,u,v) /\ ^J'j{X,u) \ Z{X,Y,u) /\ Z{X,Y)] 

< Pr[£,{X, u, v) I pixj,u)/A i S^^^^ A Z{X, Y, u) A Z{X, Y)] 
Pr[£,{X, u, v) I pixj,u)/A ^ S^u) A Z{X, Y)] 



< 



Pv[Z{X,Y,u) I pixj,u)/A i 5jvo) AZ(X,y)] 
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The first inequality holds since event Z{X, Y, u) implies that u G Xj so the events £{X, u, v) and £j {X, u, v) are 
equivalent (the same holds for -^!F{X, u)), and because Z{X, Y, u) C Z{X, Y). The second is by the definition of 
J-{X, u) (given that u G Xj it cannot be that p{xj, u)/A falls in the interval S'^vq)), and since for any events A, B, 
Pt[A AB] < Pt[A I B]. The third is by Bayes rule and since £j{X, u, v) A Z{X, Y, u) = £j{X, w, v). Let £ be such 
that p{xj ,u)/A G Se. 

First we bound the denominator, noting that there is no prior information given about the distribution for the next 
choice of radius. Since £ < N{j) we can bound Pr[Z{X, Y, u) \ p{xj, u)/ A ^ '^N{j) A Z{X, Y)] > 2"^, since with 
this probability the radius for the cone Xj will be chosen from Sm ■ A with m > £ so it will large enough to contain 
u. The numerator PT[£j{X, u, v) \ p{xj,u)/ A ^ ^'atq) A Z{X, Y)] can be bounded by ■ i • which is 

the probability that we reach the £-th interval, not continue to the next one (note that the next interval exists because 
£ < N{j)) and when choosing rj uniformly from Se, it happens to be the place that separates u, v. The probability 
for the first event is 2^'^"^\ the second is 1/2, and the third is ^j^- Since \Si\ > min{l, ^^^^ } • it follows that 
Pii£j{X, u, v) I p{xj,u)/A i Snu) a Z{X, Y)] < i2d(^..t,)m^x^{i,iogx,-} ^ conclude that 

Ti iPtY ^ A Tfv \i V M ^ I2d{u,v) max{l,\ogXj} 
Pr[£[X,u,v) A ^JF(A,fi) | Z[X, Y,u)\ < — —. 

□ 

Proof of Lemma 7. Fix any i> 1 and X*^*^ = X*^*' (u). As described before we partition the event , u, v), given 

a fixed cluster X'*' into the three cases. 

Pi[£{X^'\u,v)] 

= Pr[£{X^'\u,v) A J'iX^'lu)] + Pt[£{X^'\u,v) A ^J'iX^'lu)] 

= Pr[£{X^''\u, v) A J'iX^'lu)] + Pr[£(X«, u, v) A g{X^'\u)] + Pr[£(X«, v) A u) A --GiX^'l 

The last equality holds since event Q{X^'^'' , u) implies that -^T{X''^\ u). We claim that the following hold: 

Pii£{X^'\u,v) AT{X('\u) I X«] < 48d(w,w)/(eA) (2) 
Pi[£{X'^'\u,v) Ag{X('\u) I < 5d{u,v)/{eA) (3) 

Pt[£{X^'\u,v) A ^T{X^'\u) A ^g{X^'\u) | X«] < 12d(u, w)/(eA) • Ey,„[max{l, logx(X, i^, u)}] (4) 

(2) holds directly from Claim 8. (3) since the radius of the central ball is chosen uniformly from interval of length 
A/(16c) > eA, and for the first cone from interval of length eA/4. (4) holds by using Claim 9 and writing 



Pt[£{X'^'>,u,v) A^T{X^'>,u) A^g{X'-'>,u)] < Ey,„ Pt[£{X'^'> ,u,v) A ^T{X^'> ,u) \ ^g{X^'> ,u)] 

< ^^) Ey.„[max{l, logx(X, Y, u)}] 

e ■ A 

Combining these three equation yields that for C = 65 

Pt[£{X'-'\u,v)] < C ■ d{u,v)/{eA) ■EY,u[^ax{l,logx{X,Y,u)}] . 

Recall that k = 20c(ln(l/e) + 5), and Corollary 3 suggests that for any cluster X and any j > that rad^^. (Xj) < 
(1 — l/(20c))rada;Q (X), hence for any event given that X'*^ happened 

rad(X(*+'^)) < (1 - l/(20c))'= • rad(xW) < e • rad(xW)/32, 

therefore diam(X < e-rad(X(*))/16 and by definition?/ e so fixing any F such that event F, ?i) 
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occurred then if X C y alsoX(^+'=) C By,^^ (u, e • rad(XW)/16). 

EY,u[logx{X^'\Y,u)] = log|xW| -Ey,„[log|By,d,.(M,e-rad(xW)/16|] 



< log|X«|-Ey,„ 



Pr[X(*+'^) I Z(X(*), y, u)] • log \X 



(i+fc)| 



log|X«| - J2 Pr[^'''+'''^ I XW] • log|X 



(i+fc)l 



X(»+'=)CX(') 



We conclude that 



Pr[£(X(*),u,w)] 



< 



log I - ^ Pr[X('+'=) I X'^)] • log I 

X('+'=)CX(') 



E_Y(„ [log |] - J2 Pr[^*'^] XI Pi-[^''+''^ \X^'^]- log I 
E;,« [log|xW|]-E^(,+., log|x('+^-)| 



□ 
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A Extending to Weighted Graphs 

In order for our algorithm to work for general weighted graphs, we will make the following change: After choosing 
the points Xj, yj in the cone-cut algorithm, create an imaginary point y'j which lies on the edge yj,Xj such that 
d{xo ,y'j) = radj;;, (Xo ), then return the point y'^ . Note that then the inequality ra-d^o {Xq ) + d{y'j ,Xj)+ rad^:^. {Xj ) < 
(1 + e)Ya,dxa{X) will hold, which is the only place we used the unweighted property of G. With a slight change to 
the algorithm the number of imaginary points added is at most the number of edges in the original graph G. This is 
because the point y'^ is connected only to yj in the central ball Xq, so if in the recursion depth when cutting a cluster 

X, the edge is cut by the central ball Xq, then the cone Xi created will contain only one point - yj = xg, so in such a 
case it will hold that rad^„ (^o) + d{yj ,yj)+ ra^dx^ {Xg) < (1 + e)rad:r,-| (X), and we will not add another imaginary 
point. 

The other change to the algorithm is contraction of small edges, following [10]. Let G = (V, E) be the original 
graph of size n. At every recursive step of hierarchical star partition for a cluster X with A ~ rad(X) 
we contract all edges shorter than cA / n for a constant c. Then these small edges will not be cut - it guarantees that 
every edge is at risk in at most 0(log?7) recursive steps. It remains to show that the radius does not increase - note 
that adding back all these edges will increase the radius by at most cA, and also note that our inductive proof actually 
has a slack of c'A , i.e. if we need to bound d-^ixo, z^) by i • A then we actually show that drixo, z^) < i ■ A — c'A. 
Now choosing c < c' will guarantee that even after expanding back all the edges we contracted the radius bound still 
holds. The last issue is the choice of portals in the expanded graph. If Xj is the super node in the j-th portal (recall 
that y'j is an added imaginary point), we choose Xj £ Xj which is connected to some vertex in yj and also lies on the 
shortest path from xq to z;. 
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B Improving the stretch slightly 



The factor of c log log i that was chosen as a bound on the radius increase in Lemma 4 was actually arbitrary. In fact 
we can replace it with almost any other monotone increasing function of i, the position in the queue. In order to 
optimize (asymptotically) the stretch, we take a very slowly increasing function of i, using the following definitions: 
Recall that log'"' n = n and for any integer 1 < t < log* n, log*-*' n ~ log ^log'*"^' . We use log* n = min{t | 

1 < log'*' n < 2}. For any integer 1 < t < log* n let Lpt{n) = 111=2 log'*^' n, (when t = 1 let ipi{n) — 1). 
The following two technical claims are proven in Appendix C. 

Claim 10. For any 0<a<l, i>4 and integer 2 < t < log* i, 

log'*' (z'^) <log'*'i + (loga)M_i(z). 
Claim 11. For any a > 1, i > 16 and integer 2 < t < log* i, 

log'*'(ai) < log'*' i + (2 log a)/ log 

The parameter c that was a constant can now be arbitrary number c > 2^®, i.e. it can be a function of We also 
use a different value of e = noc]pt{n) ^'^^ ^^^^ partition. Now the lemma that gives a tighter bound on the radius is 
the following: 

Lemma 12. Let 1 < t < log* c be an integer. Let {X, d) be the metric derived from an unweighted graph G = (V, E), 
Xq ^ X and Q = (zi, . . . , z\x\-i) ony ordering of X \ {xq}, also let T be the spanning tree of G returned by the 
algorithm hierarchical-star-part it ion{X^xo,Q) with parameter e = e(X, c, t) = 90eyt\|x|) ' ^'^^^ 

{d{xo, Zi) i = \ 

i ■ ia.dxo(X) 1 < i < c 

c ■ log'*' i ■ r&dx„ {X) otherwise 

Proof. The proof is by induction on the radius of X. Note that Corollary 3 guarantees that for all j = 0, . . . , to we 
have 1 < ra,dxj{Xj) < rada:o(X). 



Case 1: The case z = 1 is identical to Lemma 4. 



Case 2: The second case to consider is when 1 < z < c. 

1. First assume that Zi <E Xq. Then Zi will be at most i in the ordering of Qg*"^"' and hence at most 3i in the 
ordering of Qo- By the induction hypothesis on Xq : dxixo, Zi) < clog'*'(3i) • rada;o(Xo) < i • ^'S^dxgiX), 
using that rad^„ (Xq) < rad^jo (X)/(4c), and that log'*' (3z) < 2i. 

2. Next assume that {zi, . . . , z;} C Xi. As j/i is the first in Qo. by the induction hypothesis on Xq and Xi we 
have that dT(a:^o,yi) < dx{xo,yi) < ra.dxo{Xo) anddTixi,Zi) < i ■ rad2;i(Xi), so 

drixo^Zi) < dT{xo,yi) + dT{yi,xi) + dT{xi,Zi) 

< rada;o(^o) + i ■ rad2:i(Xi) + d{yi,xi) 

< i{Ta.dxo{Xo) + d{yi,xi) + i-a.dxi{Xi)) - (i - l)rad^o(Xo) 

< i{l + e)vadxa{X) - (z - l)rad,„(X)/(16c) 

< I ■ iadx„{X) + I ■ rad,„(X)/(170c) - i ■ rad.^ (X)/(32c) 

< i-ra.dxo{X). 

In the fourth inequahty using that rada;o(Xo) > rada;o(X)/(16c), in the fifth that — 1) < 2. 
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3. Now assume that Zi £ Xj where not all of zi , . . . , Zi are in Xj. This case further subdivides to two main cases, 
the second one divides to two subcases (this complication arises since c is not a constant anymore). 

(a) If i < c/4: First note that Zi must be at most the i — 1 element in Qj. By the insert sequence to Qq''^'^ we 
have that yj is at most the 3i < c element in Qo- Using the induction hypothesis on Xq and Xj we get that 

dTixo,Zi) < drixo^yj) +dT{yj,Xj) +dTixj,Zi) 

< i ■ Ta,da:oiXo) + («-!)• Ta.dxjiXj) + d{yj,Xj) 

< {i- l)(rada;o(Xo) + d{yj,Xj) + radx^iXj)) + rad^^oC^o) 

< (z- l)(l + e)rad,„(X) + rad,„(X)/(8c) 

< t ■ rad,„(X) - rad,„(X) + (i - 1) • rad,„(X)/(170c) + rad,„(X)/(8c) 

< i-ia.dxg{X). 

(b) Otherwise i > c/4, then there are two cases: 

• If \Xj n {zi, . . . , Zi}\ < ^/i then Zi will be at most the \/i in Qj and yj will be at most the 3i in Qo- 
Note that for c > 100, Vi < i/2, and also log'*' {3i) < i for all t > 1, hence 

drixo^Zi) < dT{xo,yj) +dTiyj,Xj) +dTixj,Zi) 

< c log'*' (Si) ■ rad,„ (Xo) + {i/2) ■ rad,^. {X^) + d{yj , Xj) 

< c-iTad^„(Xo) + (i/2)-rad^o(^) 

< iTad,„(X)/8+(i/2)rad,„(X) 

< i • rad:ro(-^), 

using that rada;o(Xo) < rad:ro(-'^)/(8c). 

• If \Xj n {zi, . . . , Zi}\ > then Zi will be at most the i-th in Qj and by Claim 5 yj will be at most 
the i^/i" in Qq. Note that i^/i" < i/2, then by the induction hypothesis 

drixo^Zi) < drixo^yj) + driyj^Xj) + dT{x.j,Zi) 

< {i/2) ■ radj;„(Xo) + i ■ rad^^ {Xj) + d{yj,Xj) 

< i ■ (rad^„ (Xq) + d{yj,Xj) + -rad^. (X^)) - {i/2) ■ rad^^ i^o) 

< i ■ rad^jo {X) + e-i- rad^^ {X) - i ■ rad^;,, {X)/ (32c) 

< i-i-adxo{X), 

using that rad^o(Xo) > radj;„ (X)/(16c). 



Case 3: The third case when i > c: 

1. If Zi e Xq then it will be at most i in the ordering of Qq*"^"' hence at most 3i in the ordering of Qq. By the 
induction hypothesis on Xq we get that 

dT(a;o,z,) <clog(*'(3i)Tad,„(Xo) < 2clog'*'(i) • rad,„(Xo) < clog'*' i • rad,o(X), 

using that for i > c,3i < i^ hence log'*' {3i) < 2 log'*' i for all t. 

2. The second case is when Zi g Xj such that \Xj n {zi, . . . , Zi}\ < Vi, then Zi will be at most the ^/i in Qj, and 
yj will be at most the i-th in Qq*^^' and hence at most 3i in the ordering of Qq. By the induction hypothesis on 
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Xo andXj, fort > 2 : 

drixo^Zi) < dT{xo,yj)+dT{yj,Xj) + dT{xj,z^) 

< c log(*) (3^) • rad,„ (Xq) + c log^*) (Vi) ■ rad,^ (X^) + d{yj , Xj) 

< c(log(*) i + 4/logi) • rad^„(Xo) + c(log(*) i - l/(^t_i(i)) • rad^,.(Xj) + d(yj,a;j) 

= c(log**^ i - l/.^t_i(i)) (rad:ro(-'^o) + d{yj,Xj) + rada,^(Xj)) + c(4/logi + l/(/9f_i(i)) • rad^^ 

< c(log(*) I - + e)rad,„(X) + (1/(2 logz) + l/(8(^t_l(^)))rad,„ (X) 

< clog(*) ^ • rad,„(X) + rad,„(X) • log^*) z/(170v?*(i)) - rad,„(X)/(pt_l(^) + 3rad,„(X)/(4^t 

< cW*) I • rad,„(X) + rad,„(X)/(170^t_i(i)) - rad,„(X)/(4^t_i(i)) 

< clog(*'^•rad,„(X), 

the third inequality using Claim 10 and Claim 1 1 . The fifth inequality holds since for every s > 1, log i > tps («)' 
and so Ta,dxo{X) / logi < ia,dxo{X) /(pt-i{i), and the sixth because log'*' i/ipt{i) = l/Lpt-i{i)- 
In a similar manner, it can shown that the same holds for t ^ 1. 

3. The last case is where Zi e Xj such that \Xj n {zi, . . . , Zi}\ > ^/i, then Zj will be at most the i in Qj and by 
Claim 5 yj will be at most the i^/^*' in Qq- Now by the induction hypothesis, for t > 2 



dT{xo,z.i) < dT{xo,yj) + d{yj,Xj) + dT{xj,Zi) 

< clog(*> i^/i" • rad,„(Xo) + clog(*) i ■ rad,^. (X,) + d{y,,x,) 

< clog'*' i(rad,.„(Xo) + d{yj,Xj) + rad^^(Xj)) + clog(9/10)/(^t_i(i) • rad,.„(Xo) 

< clog(*) i ■ rad,„(X) + e ■ clog'*) i ■ rad,„(X) - c • rad,„(Xo)/(10^t_i(i)) 

< clog'*) i ■ rad,„(X) + rad,„(X) • log'*' V(170<^t(0) " rad,„(X)/(160(pt_i(i)) 

< clog'*'^•rad,„(X), 

the third inequality using Claim 10. In a similar manner, it can shown that the same holds for t — 1. 

□ 

Proof of Theorem ] . Take c = 2^® log'*' ti (recall that t = (log*n)/2 and indeed t < log* c). Note that 1/e = 
0(c(/9t(n)) andtheparameterfc ~ 0(clog(l/e)) = 0(c log log log n). The increase in radius is rad(T) < 0(c^rad(X)), 
and plugging in these parameters to Lemma 7 implies that the expected stretch for any edge {u, v) E E is bounded by 

IET~rMT(u,w)] < O {c^ -logn- k/e) 

= 0{c'^ \ogn ■ \oglog\ogn ■ (pt{n)) 

= O (^log(i) n ■ log(2) n ■ log(3) n • • • log«'°s* "'Z^) n ■ (log«'°s* n) * • log'^' r?j 

□ 
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C Proof of some claims 

Proof of Claim 10. We prove by induction on t. The base case where t ~ 2 holds since log log (i") = log(alogz) = 
log log i + log a. Assume the claim holds for t and we prove for t + 1 

log(*+i^ (z-^) = log (log(*^ (i'^)) 

< log (log(*) ^ + (loga)/v3t_i(i)) 

= log (log(*) * • (l + {\oga)/{^t-i{i) ■ log(*) i))) 

< log(log(*)i- (2'°s''M(0^^ 

= log(*+i'i + (loga)MW. 

The first inequality uses the induction hypothesis, the last inequality holds because log a < 0, and 1 + a; < 2^ for 
a; < 0. □ 

Claim 13. For any c > 0, b > 2 and < t < log* b 

log(*^ (6 + c) < (log^*' b) + c. 
Proof. By induction on t, for t = it holds since by definition log'"' (6 + c) = 6 + c. Assume for t — \ and prove for t 

log(*)(6 + c) = log(*^i) (log(6- (l + c/6))) 

< log(*-i'(log6+(cloge)/&) 

< log(*' 6+ (cloge)/6 

< log^*' b + c. 

We used the induction hypothesis in the second inequality. □ 
Proof of Claim 11. 

\og^^\ai) = W*"^) (logi + loga) 

= log(*-i)(logz-(l + (loga)/logO) 

< log'*"^) (logi • e(i°s'')/'°s') 

= log'*^^^ (log log i + (log a ■ log e) / log i) 

< log^*' i + (21oga)/logi. 

The last inequaUty we use Claim 13 with b ~ log log i > log e. □ 
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